The components of empirical multifractality in financial returns 



Wei-Xing Zhou 1 - 2 ' 3 ' 4 ( a ) 



O 
O 

(N 
-i— > 
o 

O 

m 

H 

<N 
> 
On 
00 
O 

00 

o 
o> 
o 



School of Business, East China University of Science and Technology, Shanghai 200237, China 

School of Science, East China University of Science and Technology, Shanghai 200231, China 

Research Center for Econophysics, East China University of Science and Technology, Shanghai 200231, China 

Research Center on Fictitious Economics & Data Science, Chinese Academy of Sciences, Beijing 100080, China 



PACS 89.65.Gh - Economics; econophysics, financial markets, business and management 

PACS 89 . 75 . Da - Systems obeying scaling laws 

PACS 05 . 45 . Df - Fractals 

PACS 05 . 65 . +b - Self-organized systems 

Abstract. - We perform a systematic investigation on the components of the empirical multi- 
fractality of financial returns using the daily data of Dow Jones Industrial Average from 26 May 
1896 to 27 April 2007 as an example. The temporal structure and fat-tailed distribution of the 
returns are considered as possible influence factors. The multifractal spectrum of the original 
return series is compared with those of four kinds of surrogate data: (1) shuffled data that contain 
no temporal correlation but have the same distribution, (2) surrogate data in which any nonlinear 
correlation is removed but the distribution and linear correlation are preserved, (3) surrogate data 
in which large positive and negative returns are replaced with small values, and (4) surrogate 
data generated from alternative fat-tailed distributions with the temporal correlation preserved. 
We find that all these factors have influence on the multifractal spectrum. We also find that the 
temporal structure (linear or nonlinear) has minor impact on the singularity width Aa of the 
multifractal spectrum while the fat tails have major impact on Aa, which confirms the earlier 
results. In addition, the linear correlation is found to have only a horizontal translation effect on 
the multifractal spectrum in which the distance is approximately equal to the difference between 
its DFA scaling exponent and 0.5. Our method can also be applied to other financial or physical 
variables and other multifractal formalisms. 
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Introduction. — There are a wealth of studies show- 
ing that financial markets exhibit multifractal nature 
[1-3]. Many different methods have been applied to char- 
acterize the hidden multifractal behavior in finance, for in- 
stance, the fluctuation scaling analysis [4 6], the structure 
function (or height-height correlation function) method 
[1,7-17], the multiplier method [18], the multifractal de- 
trended fluctuation analysis (MF-DFA) [19-27], the par- 
tition function method [28-38], the wavelet transform ap- 
proaches [39-42], to list a few. There are also efforts seek- 
ing for applications of the extracted multifractal spectra. 
Some researchers reported that the observed multifractal 
singularity spectrum has predictive power for price fluc- 
tuations [29,31,38], can serve as a measure of market risk 
by introducing a new concept termed multifractal volatil- 
ity [35], and can be used to quantify the inefficiency of 
markets [43]. 



An important and subtle issue of multifractality is about 
its origin. An even critical question is to ask whether the 
extracted multifractality is intrinsic or apparent. Indeed, 
it has been shown that an exact monofractal financial 
model can lead to an artificial multifractal behavior [44]. 
It is usually argued in the Econophysics community that 
the sources of multifractal nature in financial time series 
are the fat tails and/or the long-range temporal correla- 
tion [19]. However, possessing long memory is not suf- 
ficient for the presence of multifractality and one has to 
have a nonlinear process with long-memory in order to 
have multifractality [45]. In many cases, the null hypoth- 
esis that the reported multifractal nature is stemmed from 
the large price fluctuations cannot be rejected [46]. 

In this Letter, we focus on the multifractal detrended 
fluctuation analysis of financial logarithmic returns de- 
fined as 
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r(t)=\n[P(t)/P(t-l)} 



(1) 
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where P(t) is the price at time t. Specifically, we use the 
daily data of the Dow Jones Industrial Average (DJIA) 
from 26 May 1896 to 27 April 2007 (totally 30147 trading 
days) to illustrate the method and results. The reason is 
simply that most studies in this direction use MF-DFA 
on stock returns. Nevertheless, the methodology is quite 
general and also applies in the study of other financial 
variables and other multifractal analysis. 

The most studied factor is the temporal correlation in 
the return series, where the singularity spectrum of the 
real data is compared with that of the randomly shuffled 
data [20-22,42,47,48]. All these studies show that the 
shuffled data have non-shrinking singularity width Aa and 
the vertex (a, / (a)) with q = may shift left more or less. 
These observations imply that the heavy-tailed distribu- 
tion of the returns has crucial impact on the singularity 
width. Extensive numerical experiments using uncorre- 
cted time series obeying q-Gaussian distributions with 
different tail exponents unveil a convergence to monofrac- 
talilty in the Gaussian attraction basin and to bifractality 
in the Levy attraction basin [49] , which is consistent with 
the analytic derivation of truncated Levy flights [50] . Sim- 
ilar phenomena are observed for exponential distributions 
in the partition function framework [51]. To understand 
the impact of the distribution, one can either remove the 
large positive and negative returns [52] or generate sur- 
rogate data having a Gaussian distribution while keeping 
the linear correlation of the original data [26,27,53]. In 
this Letter, we will systematically investigate these fac- 
tors together with a new factor reflecting possible hidden 
patterns in the raw time series. 

Memory effect. - We adopt the MF-DFA method 
to obtain empirically the singularities a and the corre- 
sponding spectrum /(a) for each time series [19]. In all 
the cases, the scaling range of the detrended fluctuation 
function with respect to the time scale is [30,3000], the 
moment order q varies from -5 to 5, and the second-order 
polynomial (parabolic) is used for detrcnding.The deter- 
mination of the scaling range is a subtle issue since the 
intrinsic scaling behavior will be masked or deformed by 
high-order trends or nonlincarity [54-56] . The power laws 
of the detrended fluctuation functions arc not perfect. 
However, we find that the resulting multifractal spectra 
do not change much when we using a much narrower scal- 
ing range [30, 300]. When the time scale is 3000, there are 
only 20 boxes from both ends of the time series. To gain 
better statistics, we modify slightly the MF-DFA algo- 
rithm in the partitioning of boxes for a given time scale. 
Consider a time series {r(t)\t = 1,2, •■■ ,N}. For time 
scale s, we select a random sequence {ji\i = 1, 2, ■ ■ • , n}, 
which are uniformly distributed in [l,i — s + 1]. In our 
analysis, we use n = 2000. This sequence determines n 
boxes + s — 1], in which the locally detrended fluc- 

tuation functions Fi(s) are calculated. If the time series 
is long-term power-law correlated, F (s) scales as a power 



law of s [19] 

f n Y h 

F ( s )H-D F *( s )] 9 j ~ sHq) ( 2 ) 

The mass scaling exponent r(q) in the partition function 
formulism can be determined as 

r(q) = qh(q) - 1 (3) 

and the singularity strength a and its spectrum f(a) can 
be calculated according to the Legendre transform [57] 

J a = h(q) - qh'(q) 

\ f(a) = q[a-h(q)} + l ' W 

The multifractal spectrum f rig(o-) of the return series 
is thus determined and shown in fig. [TJ We then shuffle 
the return series 100 times and determine their singularity 
spectra / s huf(a)- For each point on the / s huf(o:) curve of 
the shuffled data, a and / s huf(a:) are the arithmetic aver- 
ages of the respective 100 values of the shuffled data and 
the error bar is the corresponding standard deviation. The 
singularity width is Aa or i g = 0.22 for the original data 
and Aa s huf = 0.18 ± 0.04 for the shuffled data. These 
results are consistent with previous works. We also ob- 
serve that the surrogate data have negative dimensions 
for larger q, which implies that the surrogate data con- 
tains more randomness in large values of local detrended 
fluctuations [58,59]. 
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— Shuffled data: /shuf(a) 
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Fig. 1: Comparison of the multifractal spectra / or i g (a) for the 
original data and / s huf (a) for the shuffled data. 

Effect of distribution. — As we have shown above 
that, shuffling the return time scries docs not eliminate 
its multifractality. This is not surprising since there is no 
(or very weak) long-term correlation in the returns. Since 
the shuffled time series still exhibits a wide singularity 
spectrum, it is natural to conjecture that the distribution 
of returns has essential impact on the f(a) curve. We 
try to systematically test this conjecture using surrogate 
data. Two methods are adopted to generate surrogate 
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data. One method is to replace the returns having large 
magnitudes with random numbers drawn from a normal 
distribution [52]. The other method is to substitute the 
raw returns with data drawn from prescribed distributions 
by keeping the ranking order which is relevant to the phase 
randomization algorithm [60] but with some differences. 

The truncation method. The truncation method was 
originally proposed to study the impact of large positive 
and negative values on the multifractal singularity width 
Aa of the foreign exchange rate returns, where the returns 
with the magnitudes greater than Ma were eliminated and 
replaced by linear interpolations [52] , where a is the stan- 
dard deviation of the raw time series. For convenience, the 
resulting data are termed as truncated data. For the FX 
returns, the singularity width Aa of the truncated data 
increases as the normalized threshold M increases [52]. 
When the truncated time series is shuffled, the singular- 
ity width Aa decreases dramatically [52]. These analyses 
illustrate that large values in the FX returns have signif- 
icant impact on the width of multifractal singularity and 
the temporal structure of the truncated series becomes a 
stronger factor on its multifractality. 

We have followed this idea and constructed the trun- 
cated time series in a slightly different way. The substi- 
tute for returns with |r(f)| > Ma is a collection of re- 
turns re-sampled randomly from the return series with 
\r(t)\ < Ma. We have generated 100 truncated data sets 
with the threshold Ma spanning from a to 13<r. The de- 
pendence of the singularity width Aat lun on M is illus- 
trated in fig. [21 It is clear that the width of the multi- 
fractal spectrum shrinks when the normalized threshold 
M decreases. For each vale of M, the truncated data are 
shuffled to generate 100 shuffled truncated data sets. For 
each shuffled truncated data set, we determine its singu- 
larity width Aa s ht r - The results are also presented in fig. [3] 
for comparison. On average, we find that Aa s ht r < Aatmn 
with some exceptions where Aa s htr = Aa trU n within the 
error bars. This phenomenon is similar to the case of FX 
returns [52]. 

Surrogate data with different PDFs. An alternative 
method to generate surrogate data is to replace the raw 
data by random numbers drawn from a prescribed dis- 
tribution. A similar idea was implemented to investigate 
the sources of multifractality in the returns of the Ira- 
nian rial - US dollar exchange rate [53], where the phase 
randomization algorithm was adopted and the surrogate 
data have a Gaussian distribution while keeping the lin- 
ear correlation of the original data [60]. Our algorithm 
for generating surrogate data is described as follows. For 
a given distribution, we generate a sequence of random 
numbers {xo(t)\t — 1,2, •■■ , T}, which are rearranged 
such that the rearranged series {x(t)\t = 1, 2, • • • , T} has 
the same rank ordering as the return series {r(i)|t = 
1, 2, ••• ,T}. In other words, x(t) should rank n in se- 
quence {x(t)\t = 1, 2, • • • , T} if and only if r(t) ranks n 
in the {r(t)\t = 1,2, •• • , T} sequence [61,62]. The series 
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Fig. 2: Dependence of the singularity width Aa on the nor- 
malized threshold M for the truncated time series from which 
returns with \r(t)\ > Ma have been deleted and replaced 
(Aatrun) and for the shuffled truncated data (Aa a htr). 



{x(t}} is rescaled to have the same standard deviation a 
of the returns {r(t)}: 



x(t) — > x(i) x a/a x + fx, 



(5) 



where a x is the standard deviation of {x(t)} and \x is the 
sample mean of {r(t)}. 

In our analysis, we have investigated two types of dis- 
tributions. The first one is a family of "double" Weibull 
distributions 

(6a) 



p(x) = fix 
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where the shape parameter (3 describes the heaviness of 
the tails and we require that f3 < 1. The second one is a 
family of Student's t distributions 
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which have power-law tails with exponent 7. 

Fig. [3] compares the original return scries with four sur- 
rogate time series generated from a normal distribution, a 
Laplace distribution, a Weibull distribution with [3 = 0.5, 
and a Student distribution with 7 = 3, respectively. We 
can see that all the surrogate series exhibit similar cluster- 
ing phenomenon as the original returns. In other words, 
the volatility (absolute value) of the surrogate data has 
long-term correlation, which is still true for all the cases. 
This intriguing feature is very important since it is absent 
in the surrogate data according to the phase randomiza- 
tion algorithm. In addition, the two surrogate series with 
fat tails share more similarity with the original returns 
than those from the normal and Laplace distributions. 
Certainly, a close scrutiny of the time series will unveil 
differences in their finer structure. 

For the case of Weibull distributions, we investigate 11 
values of the exponent f3, varying from 0.45 to 0.95 with 
a spacing step of 0.05. For smaller f3 values, we And that 
the multifractal spectra are not stable due to the poor 
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Fig. 3: Comparison of the original return time series with its 
surrogates. The surrogate time series from bottom to top are 
generated from a normal distribution, a Laplace distribution, a 
Weibull distribution with /3 = 0.5, and a Student distribution 
with 7 = 3. The original return series is at the bottom and 
the four surrogate series have been vertically shifted for better 
visibility. 

statistics caused by extreme jumps in the time series. For 
each j3, we generate 100 surrogate time series and the aver- 
age multifractal spectrum is determined. The multifractal 
spectra for = 0.45, 0.55, 0.65, 0.75, 0.85 and 0.95 are 
illustrated in fig. 0Ja). It is evident that time series with 
heavier tails (or small (3) exhibits stronger multifractality. 
This quantitative dependence of the average singularity 
width Aq as a function of the exponent (3 is depicted in 
fig- lib). 
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Fig. 4: (a) Multifractal spectra of surrogate time series gen- 
erated from Weibull distributions with /3 = 0.45, 0.55, 0.65, 
0.75, 0.85 and 0.95, which have the same rank ordering as the 
original return series, (b) Dependence of the singularity width 
Aa as a function of the exponent (3. The error bars are the 
standard deviations for the 100 surrogate series. 

For the case of Student distributions, we investigate 13 
values for the tail exponent 7, varying from 3 to 9 with 
a spacing step of 0.5. We choose the minimal value of 
7 = 3 since the returns at the transaction level are well 
modeled by Student's t distribution with 7 = 3 [63], which 



is the well-known inverse cubic law [64], and the tail ex- 
ponent increases with the time scale [1]. For each 7, we 
generate 100 surrogate time series and the average mul- 
tifractal spectrum is determined. Figure [5](a) illustrates 
the multifractal spectra for 7 = 3, 4, 5, 6, 7, 8 and 9. 
Again, it is evident that time series with heavier tails (or 
small 7) exhibits stronger multifractality. Comparing with 
fig. BJa), we find that the right parts of the f(a) curves 
for the Student's t distributions are closer to each other, 
which means that the small values characterized by nega- 
tive q's are more irregular. The quantitative dependence 
of the average singularity width Aa as a function of the 
exponent 7 is shown in fig. [5jb) . 
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Fig. 5: (a) Multifractal spectra of surrogate time series gen- 
erated from Student's t distributions with 7 = 3, 4, 5, 6, 7, 8 
and 9, which have the same rank ordering as the original return 
series. Dependence of the singularity width Aa as a function 
of the exponent /3. The error bars are the standard deviations 
for the 100 surrogate series. 

Hidden nonlinear structure. — In the above anal- 
ysis, we have investigated the impact of temporal corre- 
lation and probability distribution on the multifractal na- 
ture. More rigorously speaking, the shuffling approach is 
very aggressive, which not only removes linear correlations 
but also eliminates any hidden structure in the original re- 
turn series. It is thus interesting to assess the impact of 
hidden structure. This can be done with the help of surro- 
gate time series which has the same distribution and linear 
temporal correlation as the original data. We find that the 
DFA scaling exponent of the DJIA returns is H = 0.54. 
Although the DFA scaling exponent is very close to 0.5 for 
uncorrelated time scries, it may still contain some nontriv- 
ial information about the linear correlation in the original 
series. It is worth noting that this method will be more 
interesting for other financial time series whose DFA scal- 
ing exponent is significantly larger than 0.5 and thus have 
long memory. 

The algorithm for the generating of surrogate data 
is based on a simple iteration scheme called iter- 
ated amplitude-adjusted Fourier transform (IAAFT) [65], 
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which is an improved version of the phase randomization 
algorithm [60]. The return data {r(t)\t = 1,2, ■■■ , N} are 
sorted resulting in a new sequence {sat}, and we obtain 
the squared amplitudes of the Fourier transform of {sjv}, 
denoted as {S^}. The initial sequence {s^} of the iter- 
ation is a random shuffle of {sjv}- In the i-th iteration, 
the squared amplitudes {S^'^} of the Fourier transform of 

{sjv } are obtained and replaced by {S%}, which are trans- 
formed back, and then the resulting series are replaced by 
{sjv} but keeping the rank order. We generate 100 sur- 
rogate time series and perform the iteration 20 times for 
each surrogate series. 

Fig. [5] plots the averaged multifractal spectrum / SU rr(aO 
of the surrogate series and the / or ig(o ! ) curve of the 
original return series as well. The singularity width is 
Acquit = 0.18 ±0.04 for the surrogate data, which is close 
to Aa or i g = 0.22 for the original data. For the right part 
where q > 0, the / SU rr(a) curve for surrogate data is em- 
braced by that for the original data. For the left part 
where q < 0, the two curves almost overlap with the error 
bar for a large part. We also observe that the surrogate 
data have negative dimensions, which implies that the sur- 
rogate data contain more randomness in large values of 
local detrended fluctuations [58,59]. 
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Fig. 6: Comparison of the multifractal spectra / or i g (a) of the 
original data and / SU rr(aO of the surrogate data. 

It is also interesting to compare the f(a) curve for the 
surrogate data in fig. [5] and for the shuffled data in fig. [TJ 
We find that these two curves / SU rr(o!) and / s huf(«) have 
the same shape with a horizontal shift. We have approxi- 
mately that 

/surr(a) = /shut (a + A a ), (7) 

where A Q = a surl , q =o — a s huf,<j=o ~ 0.04. Note that the 
value of A a is close to the difference of the two DFA 
scaling exponents, H SUII — i? s h u f = 0.04, where H surl = 
H ol -i g = 0.54 and H shn{ = 0.50. Since a(0) = h(0), this 
relation can be understood as a first-order approximation 
in the sense that a surr (0)-a shu f(0) = h Bn „(0) -h sbu { (0) « 
frsurr(2)-/i s huf(2) = H sun - - H shu f . The relation (JT]) means 
that the difference between the multifractal spectra of the 



surrogate and shuffled data is caused by the weak tempo- 
ral correlation remained in the surrogate data. In general, 
the linear correlation in the time series does not influence 
the shape of the singularity spectrum (and thus its width) 
but rather has a shift effect. 

To further quantify the presence of hidden nonlinear 
features in the return time series and their effect on the 
multifractal spectrum, some remarks are in order based 
on the magnitude and sign decomposition method [66,67]. 
The daily DJIA return series can be decomposed into a 
magnitude (or volatility) series and a sign series. We find 
that the DFA scaling exponent of the magnitude series 
is 0.87, confirming the well-known stylized fact that stock 
market volatility has strong long memory. In contrast, the 
magnitude series of the shuffled and the IAAFT surrogates 
are found to be uncorrelated. These facts confirm the 
presence of nonlinearity in the return time series. It is 
well established that the phase-randomized surrogates of 
heartbeat series have a vanishing width of singularity [68, 
69] , which is due to the fact that the heartbeat time series 
do not have fat tails, but decay exponentially [70]. 

Conclusions. — In summary, we have systematically 
studied the components of empirical multifractality in fi- 
nancial returns based on shuffled and surrogate data, tak- 
ing the daily data of DJIA for more than one hundred 
years as an example. We found that the distribution, the 
linear correlation, and the nonlinear structure have influ- 
ence on the singularity spectrum. When the degree of mul- 
tifractality is characterized by the singularity width Aa, 
we found that the temporal structure (linear correlation 
and nonlinearity) has minor impact while the fat-tailed 
distribution plays a major role, which is a confirmation of 
the earlier results. 

* * * 
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